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TITLE: Quality of Service Monitor for Multimedia Communications System 

20 BACKGROUND - FIELD OF INVENTION 

This inv^tion relates to methods of estimating the subjective quality of a multimedia 
communications system in which audio, voice or video is digitized, compressed, formed into 
packets, transmitted over a packet network and then re-assembled and decoded by a receiving 
25 system. Typical packet networks cause some packets to be lost or delayed which results in the 
quality of the decoded audio, voice or video being degraded and it is accordingly desirable to 
have some means of measuring or estimating the subjective or perceptual quality of the decoded 
audio, voice or video. 

Emerging packet based voice networks, using technology such as Voice over IP (Intemet 
30 Protocol), provide a more flexible and lower cost alternative to traditional teleconmxunicadons 
networks. They do however introduce some problems, notably increased variation in user 
perceived speech quality due to network impairments. The present invention relates to methods 
for estimating this variation in user perceived quality. 

A Voice over IP system comprises two or more conversion points and a connecting network, A 
35 conversion point is a device that converts analog voice into packet format suitable for 

transmission over a network. A conversion point may be a device within a telephone switching 
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system, a packet voice telephone, a personal computer running an applications program or other 
types of device. The following brief description may be referenced to the illustrative diagram 
shown in Fig. 1 . At each conversion point the analog voice signal &om the xiser's telephone is 
converted to a digital form, divided into short segments, compressed, placed into an IP packet 
5 and then transmitted over the connecting network to the remote conversion point. Received 
voice packets are uncompressed, converted back to analog form and played to the user as an 
audible signal. 

The connecting network relays the IP packets fiom one conversion point to ano&er. The 
network is a shared resource and is carrying many other streams of packet data. This means that 
1 0 any given packet may be subject to impairments, for example: 

(i) Delay, in which the time for the packet to get from one conversion point to the other 
conversion point causes delays in the apparent response from one user to the other 

(ii) Packet loss, in which some of the packets are lost or arrive so late that they are discarded 

(iii) Jitter, in which the arrival time of the packets varies 

1 5 (iv) Distortion, due largely to the voice compression algorithm in use. 

These impairments collectively cause the user perceived voice quality to vary considerably and 
hence Voice over IP service providers need a method for estimating the quality of service 
provided by ttieir network (Voice Quality of Service). 

BACKGROUND - DESCRIPTION OF PRIOR ART 

20 Prior art systems for measuring voice quality, as described by Douskalis (Hewlett Packard 2000), 
Royer (US 5,710,791) and Di Pietro (US 5,867,813), use centralized test equipment which 
samples the voice quality from various conversion points. A loop back condition is established 
at an conversion point, the test equipment transmits a known signal and then compares the 
received (looped back) signal with the original, thereby estimating delay, distortion and other 

25 impairments. This approach provides an accumte measure of voice distortion but only provides 
tibis measure for a sample conversion point and under the network conditions that existed at the 
time of the test. This £^proach is undesirable for continuous network monitoring as tiie frequent 
transmission of test messages increases the traffic in the network and reduces network 
p^ormance. 
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Another approach currently used for estimating voice quality is to estimate the subjective 
performance of the voice connection using objectively measured parameters. Models such as the 
E-Model described by Johannesson, (IEEE Communications Magazine 1997), are able to 
produce R ratings which can be correlated to user perceived voice quality. This process is 
5 applied by a central management system which gathers statistics on noise and delay and then 
produces an estimate of voice quality. This method as described by Johannesson does not 
consider impairments typical of packets systems. 

E:}qperimental measurements of the effects of network impauments on packet voice quality are 
reported by Cennak (TlAl contributions May 1999 and June 1998). Cermak considered the 
10 effects of average packet loss but did not consider the effects of the time varying nature of 
impairments on subjective quality. 

The QuaU.Net system marketed by ECTel comprises a central test system with additional remote 
test units. The remote test units are complex units that contain dedicated electronic circuitry and 
software and are constructed as separate items of test equipment that are externally attached to a 

15 Voice over IP system. The remote test units estimate voice quality on selected voice connections 
and report this to the central test system for diagnostic purposes. The high cost of these remote 
test units means that it is prohibitively expensive to install one for every voice connection and 
therefore only a small number are typically employed within a network. The Quali.Net system 
does not contain a statistical modeling process that analyzes the burst nature of packet loss and 

20 its effects on subjective voice quality. The Quali.Net system does not compute the estimated 
subjective voice quality within the Voice over IP end system, cannot effectively monitor tiie 
voice quality at every port simidtaneously and cannot provide per-cail voice quality information 
that can be recorded within a call record database. 

The NeTrueQoS system marketed by NeTrue comprises a central test system with remote 
25 software agents. The software agents gather network statistics and report packet loss, jitter and 
delay back to the central system which computes an estimated voice quality. Said software 
agents are located within a Voice over IP Node, which comprises a piece of equipment that 
supports multiple Voice over IP ports. The NeTrueQoS system does not contain a statistical 
modeling process that analyzes the burst nature of packet loss and its effects on subjective voice 
30 quality. The NeTrueQoS system does not compute the estimated subjective voice quality within 
the Voice over IP end system and therefore cannot effectively monitor the voice quality at every 
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port simultaneously and cannot provide per-call voice quality information that can be recorded 
within a call record database. 

Prior art systems for estimating voice quality based on measurements of network performance 
therefore suffer fi:om a number of drawbacks: 

5 (i) The use of the statistics gathered independentiy over a period of time does not reflect 

the time correlation between the statistics. If a high level of jitter coincides in time with a 
high level of packet loss then this will have a different subjective effect to the same 
impairments occurring at different times. Prior art centralized systems for estimating 
voice quality based on network statistics do not precisely correlate the times at which 

10 impairments occur and therefore do not accurately estimate voice quality. 

(ii) Typical voice coding algorithms employed in packet voice systems compensate for 
lost packets by repeating the last packet, estimatmg the content of titie lost packet or 
inserting noise. For single lost packets this approach is very effective and voice quality 
affected only slightly. When more than one subsequent packet is lost the voice coding 
15 algorithm will replay the last received packet multiple times, which is much more 

noticeable to the user. Prior art systems do not represent the way that bursts of lost 
packets affect voice quality and therefore do not accurately estimate voice quality. 

Prior art systems for estimating voice quality do not properly support Service Level Agreements. 
Telephone service providers employmg Voice over IP technology are desirous of offering 
20 Service Level Agreements in which they provide guarantees of voice quality, network 

availability and price. In order to properly implement such Service Level Agreements it is 
preferable to monitor every call and to record information on voice quality within call records. 

Prior art systems do not support packet video systems, which also suffer fix)m similar 
degradation due to the inability of the video decoder to fully reconstruct an image if the data is 

25 incomplete. Video compression systems typically employ motion coding in which the 

differences between an image and the previous image are transmitted. Errors can therefore be 
propagated through a series of subsequent images. The subjective effects of packet loss depend 
on the statistical distribution of lost packets and therefore it is desirable to consider the likely 
frequency of occurrence of multiple successive lost packets when estimating subjective video 

30 quality. 
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Accordingly there is a need to provide a method of estimating subjective voice quality within a 
packet voice system that incorporates means of deterniining the loss in subjective quality due to 
a high rate of packet loss within a short time period. 

Furthennore there is a need to provide a method of estimating subjective image quality within a 
5 packet video system that incorporates means of determining the loss in subjective quality due to 
a high rate of packet loss within a short time period. 

Furthermore there is a need to provide a means of estimating subjective quality within a packet 
multimedia communications system that can determine said estimated subjective quality for 
every multimedia call in progress. 

10 Furthermore there is a need to provide a means of estimating subjective quality within a packet 
multimedia conomunications system that can detennine said estimated subjective quality for 
every multimedia call in progress and record said subjective quality within a call record 
database. 

Fxirthermore there is a need to provide a means of estimating subjective quality within a packet 
1 5 multimedia communications system that is of low implementation complexity and can be 
installed in the form of a software addition to existing Voice over IP end systems. 

SUMMARY 

The present invention provides an improved means of estimating subjective quality in packet 
multimedia commxmications systems wherein said communications system is presumed to have a 
20 low packet loss state and one or more high packet loss states and the statistical distribution of 
time spent in each state is determined thereby to predict the degradation in subjective quality 
caused by said packet loss and this information is combined with estimated degradation in 
subjective quality due to other communications system unpairments in order to provide an 
estimated subjective quality measure for said multimedia communications system. 

25 Objects and advantages 



Accordingly, besides the objects and advantages of the present invention described above, 
several objects and advantages of the present invention are: 
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(a) to provide a method for estimating the subjective or perceptual quality of a multimedia 
commimications system which considers the effects of "bursty" packet loss or short periods of 
high packet loss interspersed with periods of low or zero packet loss; 

(b) to provide a method for estimating the subjective or perceptual quality of a multimedia 
communications system which considers the relative time at which different measured network 
impairments occur; 

(c) to provide a distributed system for estimating the subjective or perceptual quality of a 
multimedia communications system at the conversion points of the multimedia communications 
system; 

(d) to provide a distributed systrai for estimating ^e subjective or perceptual quality of a 
multimedia communications system which does not increase network traffic by requiring test 
messages to be sent in order to perform said estimation process; 

(e) to provide a distributed system for estimating the subjective or perceptual quality of a 
niultimedia communications system in which the subjective or perceptual quality is estimated on 
a per call basis and incorporated into a pall record; 

(f) to provide a method for esdmating the subjective or perceptual quality of a packet voice 
connection; 

(g) to provide a method for estimating the subjective or perceptual quality of a packet video 
connection; 

Further objects and advantages are to provide a method of estimating subjective quality for an 
audio or video streaming system in which packetized audio or video is broadcast or multicast 
through a packet network and for a multimedia conferencing system. Still further objects and 
advantages will become apparent from consideration of the ensuing description and drawings. 

DRAWING FIGURES 

Embodiments of the invention (the Voice Quality Mom'tor) are described below by way 
of example only with reference to Figs. 2 to 6 of the accompanying drawings, of which: 

Fig.l shows a schematic diagram showing an environment for the present invention 



SUBSTITUTE SHEET (RULE 26) 
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Fig 2 shows a packet voice system containing several types of conversion point 

Fig. 3 shows a packet voice conversion point system containing a Voice Quality Monitor as 
described herein 

Fig. 4 shows the architecture of the Voice Quality Monitor as described herein 

5 Fig. S shows the basic structure of the Packet Loss Model subsystem of the Voice Quality 
Monitor as described herein 

Fig. 6 shows the structure of the Voice CODEC Model subsystem of the Voice Quality Monitor 
as described herein 

Fig. 7 shows a flowchart of the basic operations of the Voice Quality Monitor 
10 DESCRIPTION 

Preferred embodiment 

A preferred embodiment of the present invention is shown in Figs. 4 to S and the application of 
the present invention within a multimedia communications system is shown in Figs. 2 to 3. 

Fig. 2 shows a typical Voice over IP network and illustrates the application of the present 
1 5 invention. A Voice over IP Gateway (200) connects to an IP network (202). A telephone (201) 
connects to a Voice over IP conversion point or port (203) contained in Voice over IP Gateway 
(200). A second Voice over IP Gateway (204) connects to IP network (202). A second 
telephone (205) connects to a second Voice over BP conversion point or port (209) contained in 
Voice over IP Gateway (204). A Voice Quality Monitor (207) is embedded into port (203) of 
20 Gateway (200) and a second Voice Quality Monitor (208) is embedded into port (209) of 

Gateway (203). Voice Quality Monitors (207) and (208) gather statistics related to call (206) 
currently in process between telephone (201) and telephone (205). Service Management System 
(210) is connected to IP Network (202) and thereby to Voice Quality Monitors (207) and (208) 
&om which it is able to retrieve information. 

25 Fig. 3 shows the basic structure of a packet voice end system in which a Voice Quality Monitor 
is embedded and illustrates the application of the present invention. 
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IP Network (202) connects to a Physical Layer Interface or PHY (301) which in turn connects to 
a Medium Access Control or MAC (302) protocol layer. These terms are commonly used in the 
industry and their meaning and usage will be clear to practitioners in the field of networking. 
MAC layer (302) connects to an Internet Protocol or IP layer (303). IP layer (303) connects to a 
5 Real Time Protocol or RTP layer (304) and to a Simple Network Management Protocol or 

SNMP layer (306). RTP layer (304) connects to a Voice CODEC (310) vsiiich connects in turn 
to an Analog Voice Port (3 12) and to an attached telephone (201). SNMP layer (306) connects 
to SNMP Agent and MIB (307). MIB (307) is extended to encompass voice quality parameters 
through a connection with a Voice Quality MIB (308). A Voice Quality Monitor (309) 
10 containing a Voice Quality MIB (308) is connected to RTP layer (304) and Voice CODEC (310). 

Fig. 4 shows ttie structure of Voice Quality Monitor (309) in more detail. An input from RTP 
layer (304) is connected to a Packet Loss Model (401) and to a Jitter Model (402) and to a Delay 
Model (403). The ou^uts from Packet Loss Model (401) and from Jitter Model (402) and from 
Delay Model (403) are connected to a Combined C^ality Degradation Estimate frinction (404). 
1 5 An input from Voice CODEC (3 10) is connected to Combined Quality Degradation Estimate 
fimction (404). The output from Combined Quality Degradation Estimate fimction (404) is 
coimected to a Thresholding and History Tracking fimction (405). Threshold and History 
Tracking fimction (405) is connected to a Voice Quality MIB (406). Voice Quality MIB (406) is 
connected to SNMP MIB (307). 

20 Fig. 5 shows the structure of Packet Loss Model (401) in more detail. Irtformation on numbers 
of packets correctly received or lost is reported by RTP layer (304) at regular intervals, counted 
by a series of Counters and used to determine an estimate of voice quality degradation. The 
input from RTP layer (304) is connected to a first Counter Ngg (500) and to a second Coimter Ngi 
(501) and to a third Counter Nig (502) and to a fourth Counter Ni i (503). The outputs from 

25 Counters Ngg (500) and Ngi (501) are connected to an Estimate P12 fimction (504) which 

estimates the probability of the packet connection switching from a good state to a loss state. 
The outputs from Counters Nig (502) and Nn (503) are connected to an Estimate P21 fimction 
(505) which estimates the probability of the packet connection switching &om a loss state to a 
good state. Estimate P21 (504) and Estimate P12 (505) fimctions are connected to a Qp function 

30 (506) which determines the voice quality degradation due to the estimated packet loss 
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characteristics. Qp function (506) ou^uts the estimated voice quality degradation to Combined 
Quality Degradation Estimate function (404). 

Fig 6. shows the structure of the Voice CODE Model (400) in more detail. Information on the 
type of voice coding algorithm and certain parameters measured during a call are passed to a 
5 series of functions each of which estimate a specific effect on subjective voice qxiality. CODEC 
(310) connects to a CODEC type function (600) and to a Clippmg function (601) and to an 
Amplitude function (602) and to a Noise Level function (603) and to a Probe Result function 
(604). The output &om functions (600), (601), (602), (603) and (604) are connected to a Qv 
function (60S). The output from Qv function (605) is connected to Combined Quality 
1 0 Degradation Estimate function (404). 

Fig 7. shows a flowchart of the principal operations of Voice Quality Monitor (309). The 
sequence of steps contained in the flowchart will be explained in detail during the description of 
the operation of the present invention contained herein. The flowchart contains four sets of 
sequential operations. A &st set of operations (700) is performed at the start of a new call and 

15 essentially comprise the computation of certain parameters that will remain approximately 

constant during said call. A second set of operations (710) is performed at frequent and regular 
intervals such as 100 milliseconds and essentiaUy comprise the updating of coxmters 500 to 503. 
A third set of operations (720) is performed at regular interfaces such as 1 second and essentially 
comprise the computation of certain voice quality parameters according to the description 

20 contained herein. A fourth set of operations (730) is performed on receipt of a request from 
Service Management System (210) and at the termination of a call. 

Description of Operation 

In order to make the operation of the present invention clear and apparent it is useful to briefly 
outline the operation of the Voice over IP conversion point with reference to Figs. 2 and 3. 

25 According to tiie diagram shown in Fig. 2 Telephone (201) places a long distance call to 

Telephone (205). The analog voice path from Telephone (201) is connected to Voice over IP 
conversion point (203) contained in Voice over IP Gateway (200). Dialing information from 
telephone (201) causes Voice over IP Gateway (200) to estabUsh a connection (206) through IP 
Network (202) to Voice over IP conversion point (209) contained in Voice over IP Gateway 

30 (204) and thereby to telephone (205). When connection (206) is established voice 
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communicatioiis between Telephone (201) and Telephone (205) is provided by means of a 
packet voice connection between Voice over IP conversion point (203) and Voice over IP 
conversion point (209). 

With reference to Fig. 3, for voice packets received from IP Network (202) - denoted the 
5 receiving direction. An arriving voice packet is received from IP Network (202) and passed 
through the PHY layer (301) and MAC layer (302) to the IP layer (303). Packets are identified 
to permit IP layer (303) to determine wh^e to route each packet. The arriving voice packet is 
passed to the RTP layer (304) at which point the contents of the packet, comprising a digitized 
compressed voice segment, is passed to Voice CODEC (310). RTP layer (304) identifies a time 
1 0 indicator in each received packet by which means it is able to assemble the packets into a proper 
sequence and determine if packets are delayed or missing. RTP layer (304) counts packets 
received and missing packets and determines at least the average, maximum and minimum 
delays. Voice CODEC (310) uncompresses or decodes the received voice segment, converts it 
back to analog form and plays it as an audible signal to Telephone (201). 

1 5 The operation of the preferred embodiment of the present invention will now be explained with 
reference to Figs. 3 to 7. 

At the start of a new call Voice Quality Monitor (309) copies certain parameters from Voice 
CODEC (3 10) into CODEC type (600) including the type of CODEC in use, for example 
G.729A or G.723.1, and the jitter buffer level and the packet replacement algorithm. If Voice 

20 CODEC (3 10) conducts an initial line quality test or training procedure then the results of this 
line probing procedure are copied by Voice CJuality Monitor (309) into Probe result (604). 
During the Voice over IP call establishment procedure the Voice over IP conversion point (203) 
sends and receives mitial call setup packets and is thereby able to measure the round trip delay of 
the connection through IP Network (202). Voice Quality Monitor (309) copies flie measured 

25 round trip delay from Voice over IP conversion point (203) to Delay Model (403). Voice 

Quality Monitor (309) resets all other variables and counters to zero, including Coimters (500), 
(501), (502), (503). These steps are shown in Fig. 7 steps 701 to 704. 

At regular and frequent intervals, for example 100 milliseconds. Voice Quality Monitor (309) 
performs the following actions (shown in Fig. 7 steps 71 to 714). 
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(a) reads firam RTP layer (304) R the count of the numbers of packets received and X the 
number of packets lost Counters (500), (501), (502) and (503) are updated according to 
the following procedure: 



If(R-X)>5 andX< 2 then 

if state ^GthenNgg ^Ngg^l 
if state = L then Nig ^Ngg-^-J 
state = G 

else 

if state ^GthenNgj=-Ngj + 1 
if state = L then Nij = iV/y + 1 
state = L 
end if 



{Low loss rate) 
{Counter (500)} 
{Counter (502)} 

{high loss rate} 
{Counter (501)} 
{Counter (503)} 



5 (b) reads from RTP layer (304) the current average Javg jitter level expressed as an integer 

number of milliseconds and stores said parameters within Jitter Model (402) 

(c) requests RTP layer (304) to reset its coimters 

(d) reads from Voice CODEC (310) the background noise level expressed as a 
percentage of the maximum possible value and stores this parameter within Noise Level 

10 (603), and also reads the peak received voice amplitude expressed as a percentage of the 

maximum possible value and stores this percentage within Amplitude (602), and also 
determines if the received voice amplitude has reached the maximum allowable level and 
if so increments counter Clipping (601). 

At regular intervals, for example 1 second. Voice Quality Monitor (309) computes an estimate of 
.15 the received voice quality based on the statistics gathered during the preceding interval. The 

procedure for computing said estimate comprises the following five steps (shown in Fig. 7 steps 
721 to 725): 

(i) Voice CODEC quality degradation (400) is computed using the following 
procedure: 
Define Base Quality as BMOS 
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If CODEC type = G. 723 J then BMOS = 4 
If CODEC type = G. 7 29 A then BMOS ^4,1 
BMOS = BMOS- Clipping/20 
If Amplitude < 20% then BMOS = BMOS - 0.25 
BMOS = BMOS ' Noise level/200 



{CODEC type (600)} 

{Clipping (601)} 
{Amplitude (602)} 
{Noise level (603)} 



(ii) Packet Loss quality degradation (401) is computed using the following procedure: 

Pi2 = Ngi / (Ngg + Ngi) {Pi2(504), Ngg (500), Ngi (501)} 

P21 =Nig/(Nii+Nig) {P2i(505),Nig(502),Nii(501)} 
Qp = 5Pi2(2-P2i) {Qp(506)} 

(iii) Voice quality degradation due to Jitter (402) is computed usiag the following 
procedure: 

= (J + 0.025 J^) / 500 {J is Jitter m milliseconds} 



(iv) Voice quality degradation due to Delay (403) is computed using the following 
procedure 

Qd = 0.0018 D + (D / 1000)^ {D is delay in milliseconds} 



(v) Combined Voice Quality Degradation estimate (404) is computed using the 
following procedure: 
Q = BMOS-Qp-Qj-(b 
IfQ<OthenQ = 0 

The computed voice quality value Q is transferred to Thresholding and History Tracking 
function (405). The value of Q is stored within a historical list of predefined length contained 
within Voice Quality MIB (308) to permit later retrieval by Service Management System (210). 
The mavtiTnim^ MaxQ, and fniniTnuTn^ MinQ, and avemge, AvQ, values of Q are determined 
within Voice (Quality MIB (308) accordmg to the following procedure: 

Sample count N = N + 1 
TotalQ = TotalQ + Q 
AvQ = TotalQ/N 
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If Q > MaxQ thenMaxQ = Q 
If Q < MinQ then MinQ = Q 

An Integrator R contained within Threshold and History Tracking function (405) is also i5)dated 
according to the following procedure: 

K is initialized to 100 on call establishment 
K=4-Q 

IfK<OthenK = 0 
R = R.6K^ 

IfR<100thenR = R+l 

5 The value of R is compared to a tibreshold value and an indication transferred to SNMP Agent 
(307) shoxild the value of R be below said threshold value, according to Fig. 7 steps 73 1 and 734. 
In the preferred embodiment the value of said threshold value may be modified by Service 
Management System (210) and is by de&ult 20. 

At any time during a call Service Management System (2 1 0) may transmit an SNMP message to 
1 0 SNMP Agent (307) to request an immediate report of voice quality. On receipt of such a 

message SNMP Agent (307) reads the current values of maximum, minimum and average Q, and 
R voice quality estimates and incorporates these values into an Shj^MP message which is 
transmitted back to Service Management System (210) according to Fig. 7 st&ps 732 and 734. 

At any time during a call Voice Quality Monitor (309) may request RTP layer (304) to insert a 
1 5 representation of current voice quality parameters Q and R into a transmitted voice packet in 
order that congestion management functions within IP Network (202) can be continually 
informed of voice quality without requiring additional packets to be sent. 

At the completion of a call the Average and Minimum values of voice quality are transferred 
torn Voice Quality MIB (308) into a call completion packet for transmission to Service 
20 Management System (2 1 0) wherein these values are retained in a database for later use. 

As may be seen from the preceding description of operation the present invention provides a 
comprehensive estimate of voice quality without adding excessive complexity to the Voice over 
IP system and is accordingly economical to implement and deploy within such syst&ms. 



wo 01/80492 



14 



PCT/USOl/40499 



Alternative embodimeiits 

Other variations of a Voice Quality Monitor which achieve a similar result to the preferred 
^bodiment are described below as illustrative of the wide variety of implementations that fall 
within the purview of the present invention. 

Other packet loss models that can be employed to represent the possible states of the packet 
connection include, but are not limited to: 

(i) A 2-state Markov process in vAdch one state represents a low packet loss rate and the 
other state a high packet loss rate; 

(ii) A 3-state Markov process in which one state represents a zero packet loss mte, a second 
state represents a low non-zeto packet loss rate and a third state represents a high loss 
rate; 

(iii) A probabilistic automata based learning model in vMch a higher order Markov model is 
dynamically constructed; 

(iv) A renewal process which attempts to model the sequences of correctly received and lost 
packets using a Pareto, Hyperbolic or similar distribution; 

(v) A neural network based algorithm which learns fiom the observed network impairments 
to predict the continuous behavior of the packet connection. 

The parameters of the packet loss model and other measured impairments may be transmitted to 
a central management system for conversion to a voice quality factor. _ 

Conclusion, Ramifications and Scope 

One braefit of the present invention is tluit the statistical properties of impairments can be 
properly considered vAien estimating voice quality. For example, if the packets are lost in 
groups or sequences the subjective impact is greater than if packet losses occur individually. The 
Voice Quality Monitor is able to determine information about the distribution of packet loss and 
to use this information when estimating subjective voice quality. 

A further benefit of the present invention is that the impairments that affect voice quality can be 
correlated to give a more complete picture of user perceived quality. For example, if jitter and 
packet loss occur within the same time period then this results in a diJSerent subjective quality 
than if the impairments occur within different time periods. 
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A further benefit of the present invention is that the monitoring of voice quality can occur 
continuously at every conversion point in the network, whereas prior art systems adopt a 
sampling approach and can only provide a limited picture of the network performance. 

Alttiough the description above contains many specificities these should not be construed as 
5 limiting the scope of the uivention but as merely providing illustrations of some of the presently 
preferred embodiments of tiiis invention. For example, the packet network may use the Internet 
Protocol (BP), Asynchronous Transfer Mode (ATM), Frame Relay or other connection oriented 
or connectionless networking protocols and may use copper wire, optical fiber, wireless or other 
physical transmission media. The invention may be employed within a cellular telephone system 
1 0 with the voice quality monitor described herein located within the cellular telephone handset. 
The invention may be applied to synchronized streams of multimedia data and may for example 
detect loss of audio-video synchronization as an impairment The invention may also be xised in 
other applications tihan multimedia communications, for example any client-server application in 
which the efficiency and responsiveness of the communication between the client and the server 
15 is affected by the burstmess or non-uniformity in the time distribution of packet loss and other 
network impairments. The detailed description contains references to counters that are increased 
or decreased upon certain conditions and fbs same result may be achieved by using counters that 
are decreased or increased upon tiiese same conditions. 

Thus the scope of the invention should be determined by the appended claims and their legal 
20 equivalents, rather than by the examples given. 
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CLAIMS: 
I claim: 

1 . A method for automatically estimating tiie subjective quality of a multimedia signal 
transmitted over a packet connection, comprising the steps of: 

S (a) determining the statistical distribution of the time spent by said packet connection in 

at least a low packet loss probability state and a high packet loss probability state, and 
(b) determining the estimated effect of said statistical distribution on the subjective 
quality of said multimedia signal. 

2. A method as defined in Claim 1 , further comprising the steps of: 

10 (a) determining the level of one or more impairments, said impairments being selected 

from the group consisting of jitter and delay and packet error rate and CODEC 
distortion, and 

(b) determining the estimated effect of said impairments on the subjective quality of said 
multimedia signal. 

15 3. A method as defined in Claim 2, fiirther comprising the step of reporting said estimated 

subjective quality to a central database system following the termination of said multimedia 
signal. 

4. A method as defined in Claim 2, further comprising the steps of comparing said estimated 
subjective quality to a threshold and sending an event message to a central management 

20 system if said subjective quality is below said threshold. _ 

5. A method as defined in Claim 2, further comprising Ihe insertion of a representation of the 
estimated subjective quality of the received multimedia signal into transmitted voice packets. 

6. A method as defined in Claim 2, fiirther comprising the periodic updating of a counter 
\^erein: 

25 (i) when the estimated subjective quality is low, said counter is reduced by an amount 

dependent on said subjective quality, and 
(ii) when the estimated subjective quality is high, said counter is increased. 

7. A system for automatically estimating the subjective quality of a plurality of multimedia 
signals transmitted over a packet network wherein each multimedia signal connects two or 

30 more multimedia signal to packet conversion points and one or more of said conversion 
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points are physically grouped within an enclosure, comprisuig a plurality of quality 
monitoring functions wherein: 

(a) each of said functions monitors one of said conversion points and is contained within 
the same enclosure as said conversion point; 

(b) each of said functions estimates the subjective quality resulting from the conversion 
of received packets to a multimedia signal performed by said conversion point. 

8. A system as defined in Claim 7, wherein the quality monitoring function 

(a) determines the statistical distribution of the time spent by said packet connection in at 
least a low packet loss probability state and a high packet loss probability state, and 

(b) detennines the estimated eflFect of said statistical distribution on the subjective quality 
of said multimedia signal. 

9. A system as defined in Claim 8, wherein the quality monitoring function: 

(a) detennines the level of one or more impairments, said impairments being selected 
from the group consisting of jitter and delay and packet error rate and CODEC 
distortion, and 

(b) determines the estimated effect of said impakments on the subjective quality of said 
multimedia signal. 

10. A system as defined in Claim 8, wherein tiie quality monitoring function reports the 
estimated subjective qoatity of the multimedia signal to a central database system following 
the termination of said multimedia signal. 

1 1 . A system as defined in Claim 8, wherein the quality monitoring function compares the 
estimated subjective quality of the multimedia signal to a threshold and sends an event 
message to a central management system if said subjective quality is below said threshold. 

12. A system as defined in Claim 8, wherein the quality monitoring function inserts a 
representation of the estimated subjective quality of the received multimedia signal into 
transmitted voice packets. 

13. A system as defined in Claim 8, wh^ein the quality monitoring function contains a counter 
wherein; 

(a) when the estimated subjective quality is low, said counter is reduced by an amount 
dependent on said subjective quality, and 

(b) when the estimated subjective quality is high, said counter is increased. 
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